Rank in Wordlist | Frequency | Word |
---|---|---|
718 | 53 | %, |
3397 | 12 | 2,5 |
3664 | 11 | 1,5 |
4961 | 8 | 1,6 |
4986 | 8 | 4,5 |
5610 | 7 | 1,2 |
5611 | 7 | 1,4 |
5644 | 7 | 3,5 |
6459 | 6 | 0,3 |
6460 | 6 | 0,8 |
Rank in Wordlist | Frequency | Word |
---|---|---|
19526 | 2 | დიდღე(ჲ)ეშ |
26075 | 2 | ღე(ჲ)ე |
26076 | 2 | ღე(ჲ)ეშ |
26657 | 2 | წყურგი(ჲ)ა |
27238 | 1 | $(მა-15 |
28026 | 1 | 16%(2008 |
28206 | 1 | 18(30 |
31205 | 1 | Es-Dur(KV |
31531 | 1 | ISA(Industry |
32838 | 1 | Zauberflöte)(KV |
Rank in Wordlist | Frequency | Word |
---|---|---|
4023 | 10 | %) |
16273 | 2 | %), |
19526 | 2 | დიდღე(ჲ)ეშ |
21275 | 2 | კლასი). |
26075 | 2 | ღე(ჲ)ე |
26076 | 2 | ღე(ჲ)ეშ |
26539 | 2 | წ.). |
26657 | 2 | წყურგი(ჲ)ა |
27239 | 1 | $), |
27252 | 1 | %). |
Rank in Wordlist | Frequency | Word |
---|---|---|
718 | 53 | %, |
2053 | 20 | %. |
3138 | 13 | 80% |
3399 | 12 | 30% |
4023 | 10 | %) |
4032 | 10 | 2% |
4043 | 10 | 90% |
4962 | 8 | 10% |
4967 | 8 | 18% |
4984 | 8 | 20% |
Rank in Wordlist | Frequency | Word |
---|---|---|
32215 | 1 | R&B |
32216 | 1 | R&B-ს |
32217 | 1 | R&D |
32309 | 1 | S&M |
Rank in Wordlist | Frequency | Word |
---|---|---|
7607 | 5 | $; |
11803 | 3 | $. |
16271 | 2 | $, |
16272 | 2 | $500 |
27238 | 1 | $(მა-15 |
27239 | 1 | $), |
27240 | 1 | $1 |
27241 | 1 | $150 |
27242 | 1 | $17,4 |
27243 | 1 | $2.23 |
Rank in Wordlist | Frequency | Word |
---|---|---|
17070 | 2 | Don't |
17125 | 2 | I'm |
18197 | 2 | ბასტი-დ'იურფეშ |
27861 | 1 | 13°9'48 |
29481 | 1 | 39°44'N |
30720 | 1 | America's |
30795 | 1 | Avañe'ẽ |
31398 | 1 | Grimey's |
31493 | 1 | I've |
31595 | 1 | Isrā'īl |
Rank in Wordlist | Frequency | Word |
---|---|---|
1065 | 37 | მ³/წმ |
4238 | 10 | მ/წმ |
6944 | 6 | კმ/სთ |
6994 | 6 | მ.კუბ./წმ. |
9257 | 4 | 1/3 |
11815 | 3 | 1/4 |
12058 | 3 | AC/DC |
13473 | 3 | თოუნი/ვალიშ |
14149 | 3 | მგ/ლ |
16306 | 2 | 1/10 |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots